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Abstract. We show that the global balance of information dynamics for general 
quantum measurements given in [F. Buscemi, M. Hayashi, and M. Horodecki, 
Phys. Rev. Lett. 100, 210504 (2008)] makes it possible to unify various 
and generally inequivalent approaches adopted in order to derive information- 
disturbance tradeoffs in quantum theory. We focus in particular on those tradeoffs, 
constituting the vast majority of the literature on the subject, where disturbance 
is defined either in terms of average output fidelity or of entanglement fidelity. 
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1. Introduction 

The general idea for which information extraction always causes disturbance is 
nowadays widely accepted as one of the most fundamental and distinctive principles 
of quantum theory as opposed to classical theory. However, without precisely 
defining what we mean with the vague words "information" and "disturbance" , such a 
statement is nothing but an empty sentence. In order to obtain some mathematically 
sound results, one first needs to provide a natural and meaningful way to measure 
both information and disturbance. Given these definitions, an information-disturbance 
tradeoff relation is then a lower bound on (some monotonic function of) disturbance 
given in terms of (some other monotonic function of) information gain, in such a 
way that the conclusion "disturbance is null only if information gain is null" can 
be drawn from it. With this at hand, a second order of problems is to identify 
least- disturbing measurements, namely, those measurements causing the minimum 
disturbance compatible with a given amount of information extracted. For these 
kind of "optimal" measurements then one would like to see that also the converse is 
true, that is, if information gain is null, then also minimum disturbance is null, thus 
establishing an equivalence relation between information and minimum disturbance. 
This is however beyond the scope of the present paper: our main concern will be to 
unify different and usually inequivalent ways to quantify information and disturbance, 
hoping that such a clarification could simplify the derivation of optimal measurements, 
which in general is an awkward, yet important, task. 

The setting of the problem is usually as follows: a letter x, drawn from an 
input alphabet X according to the probability distribution p{x), is encoded into some 
quantum state g^. Subsequently, a quantum measurement A4 (for the moment, let us 
think of the measurement as a kind of black box performing some fixed operation) is 
performed on the given state Qx, whose label x is unknown, producing a measurement 
readout letter m, belonging to the set X of possible outcomes, together with the 
corresponding reduced state gm\x- Given the input letter x, the probability of getting 
the result m is written as a conditional probability p{m\x). Then, information is 
usually understood as "information in m about a;" and disturbance as "how well one 
can undo the state change Qx i-^ gm\x for all possible couples (x,m) £ X x 

While information can be defined to be one of the many known and basically 
equivalent measures provided by classical information theory to quantify the input- 
output correlations given the joint probability distribution p{x, mjj, we have (at least) 
two in principle inequivalent — still both meaningful — ways to measure disturbance in 
a quantum scenario: the first one, adopted e. g. in Ref. [2], measures disturbance 
depending on "how close" the states gx and gm\x are, for all x and m, possibly after 
a correcting operation performed onto the output states. We tend to call this way 
the average output fidelity approach. The second way, that we call entanglement 
fidelity approach, adopted e. g. in Ref. [3l S], measures disturbance depending on 

t We notice that, since a quantum measurement process is defined by several multidimensional 
parameters, i. e. the source and the measurement apparatus, it is extremely demanding to require 
these latter to be meaningfully summarized by means of two positive parameters only, i. e. information 
and disturbance: the very definition of information and disturbance hence represents a major problem 
in itself. 

§ Different definitions of information gain could however lead, in principle, to different least- 
disturbing measurements. For example, it is known that, for a given ensemble of states, the 
measurement optimizing the mutual information is in general different from the measurement 
optimizing the minimum discrimination error: see the analysis of the two-states case in Ref. 
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"how reversible" , or, equivalently, "how coherent" is the transformation causing the 
state change, and it is related to the possibility of reliably transmitting quantum 
entanglement through the measurement apparatus. The second approach is generally 
much more stringent than the first one, as the following example shows. Let us 
consider the simple situation in which the input quantum states Qx are actually pure 
orthogonal states. Then, the von Neumann measurement projecting onto such states 
is able to fully extract the information about x, without causing any disturbance, 
when we understand it in the first sense (indeed, in this case, gm\x = Qx and 
p{x,m) — p{x)Sm,x)- On the other hand, a complete projection like a von Neumann 
measurement causes a sudden and complete decoherence of any input state onto the 
measurement basis: it is hence amongst the most disturbing state changes possible, 
if disturbance is understood in the second sense, since the associated state change 
completely destroys entanglement. (We will be more quantitative in the following.) 

In fact, there is a third remarkable way to define disturbance, and it is the 
one adopted by Ozawa in Ref. [5]. Ozawa analyzed the case of a subsequent 
measurement of two generally non commuting observables and obtained a universally 
valid uncertainty relation, in the spirit of Heisenberg [H] and Robertson [7|, as a 
formula explicitly involving noise, disturbance, and pre-measurement uncertainties 
(i. e. standard deviations in the preparation stage). However, we prefer to keep 
Ozawa's analysis in a separate class, since he describes the evolution in the Heisenberg 
picture, considering observables evolving in time, while quantum states are fixed. 
Therefore, the notion of disturbance Ozawa uses is about observables and cannot be 
straightforwardly translated into a notion of disturbance about states, which is the 
one usually adopted when dealing with quantum communication scenarios, like e. g. 
quantum cryptography. 

The question we address in this paper is: how can we reconcile the first two 
mentioned approaches to quantify disturbance, one using average-output-fidelity-like 
criteria, the other using entanglement-fidelity-like criteria? The motivation comes 
from a previous work of ours [4] where we derived the closed balance of information 
in a quantum measurement process by introducing a quantum information gain, 
which constitutes an upper bound to the information that the apparatus is able to 
extract, independently of how this information is encoded, and a quantum disturbance, 
which is related to the possibility of deterministically and coherently undoing the 
corresponding state change. Such a balance, by construction, incorporates a tight 
information-disturbance tradeoff where disturbance is understood in the second sense. 
In a way, quantum information gain and quantum disturbance defined in U are 
intrinsic features of the measuring instrument, much like the "technical specs" of the 
apparatus, independent of the source that is to be measured. Moreover, being a closed 
balance, the formula of Ref. [4j is able to explain the physical causes for the existence 
of the tradeoff, the latter being a direct consequence of the appearance of some 
hidden correlations between the input system (undergoing the measurement) and some 
inaccessible degrees of freedom (like e.g. the environment) . The approach we proposed 
in Ref. [1] hence seems to possess some advantages with respect to other approaches, at 
least in terms of compactness and generality. It has been unclear for a while, however, 
how to derive from it a general tradeoff relation also when disturbance is understood 
in the first sense, namely, when it is measured in terms of average output fidelity. 
The previous example involving orthogonal states and von Neumann measurements 
seems to stand as an insurmountable obstacle to this program. Nevertheless, we 
will prove that it is indeed possible to circumvent it by showing that, whenever the 
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input alphabet is encoded onto non orthogonal pure states (we will be more precise 
in the following), then the two notions of disturbance are equivalent, in the sense that 
when one is sufficiently small, the other is correspondingly small. Such a conclusion 
basically comes as a corollary of Lemma 3 in Ref. 0, whose proof we adapted here 
for our needs. It also provides the fact that, the more orthogonal the input states get, 
the looser the equivalence relation between the two notions of disturbance becomes, 
until when, for orthogonal ensembles, the two definitions become inequivalent, as in 
the mentioned example. In this sense, we will have a formula which describes the 
continuous transition from the quantum theory, where information extraction implies 
disturbance (in whichever way we understand it), to classical theory, where information 
can be freely read and copied. 

As a drawback of the generality of our analysis (in particular, no symmetry is 
assumed for the input ensemble), the mathematical statement of the main result 
(Theorem [1]) will turn out to be not very efficient, mainly because of two reasons: 
the first reason is that, as we said, we will not assume any sort of symmetry which 
could simplify the analysis; the second reason is that we will relate extensive quantities 
(like entropy and entropic measures of information) with non-extensive ones (like 
fidelity and fidelity-based measure of disturbance). On the contrary, when considering 
very special cases of highly symmetric input ensembles, we will see that, as one 
would expect, it is indeed possible to obtain much more compact formulas. Such 
a simplification is achieved by considering alternative entropy-based measures of 
disturbance which can be more elegantly related with informational quantities than 
fidelities. However, a generalization of this alternative approach seems to be difficult, 
and the results we have in this direction are still largely unsatisfactory. Nonetheless, 
we present them here in Section[S]as a preliminary step towards further investigations. 

2. Notation and basic concepts 

In this section we introduce notations and recall basic concepts and facts that will be 
useful in the following. Let us consider the following common situation: some source s 
draws the letter x from an alphabet X , according to the probability distribution p{x), 
and correspondingly emits the signal state [gf- > and Tr[£»^] — 1) belonging to 
the d-dimensional Hilbert space Q. (Throughout this paper, all Hilbert spaces will be 
finite dimensional.) Such a source is modeled as a quantum ensemble 

5-^{p{x),gf}xex, (1) 

whose average state is denoted by g^ :— J^xPi-'^)^?'- ^ measurement is then 
performed onto the system Q (the input of the measurement apparatus) to get 
information about the encoded letter x G X. As proved by Ozawa in Ref. |12j . 
the most general description of any experimentally realizable quantum measurement 
is given in terms of a completely positive (CP) quantum instrument, that is, a CP- 
map-valued measure {£.^}jn^x, defined on a set of classical outcomes m G X, and 
normalized such that J2mGX is trace-preserving (i. e. a channel). When the 
input state is described by the density matrix g^, the probability of obtaining the 
outcome m is given by p{m\g) := TY[£,^{g^)]; correspondingly, the state change 
g^ 1-^ cr^l^ := £^{g^)/p{m\g) occurs, Q' denoting the output Hilbert space. With a 
little abuse of notation, it is also helpful to represent the average state change caused 
by the measurement ® as a black box performing some fixed operation on the input 
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state, namely, as a channel with hybrid quantum-classical output system 

M^{g^) := ^ P{m\g)a^^^ ® \m}{m\^, (2) 

where \m){m\'^ are a set of orthonormal (hence perfectly distinguishable) vectors in 
the classical register space X of outcomes. 

In addition, we will find it useful to introduce an auxiliary reference IZ (that 
we choose isomorphic with Q) purifying the input state into j^'''^^) and going 
untouched through the whole measurement process, in such a way that 

= ^p(m|e)S^|f ®|m)(m|^, (3) 



mex 



where 



^^Q, (id^^gg)(^^g) 

'"l^? ■ p{m\g) 



(4) 



It is clear that Tttz[Y,'^^ ] = cr^^^, for all m € X. Moreover, by denoting 
:=Trs,[S^|f],wehavc 

p{m\g)r:^^^ = Trs[*^S] =: (5) 

mex 

where ^''^^ stands for the projector |\E'^^)(\E'^®|. It means that the action of the 
instrument on Q induces the ensemble decomposition {p{m\g),T^^^}mex on 7?.. 

Even being a formally defined — hence non directly accessible — system, the 
reference TZ will play a major role in our analysis. In order to better understand its 
meaning, it is suitable to think about TZ as the remote system about which we actually 
want to extract information by measuring the system Q, the latter representing in fact 
just the carrier undergoing the measurement. Indeed, consider for example the case in 
which the state \^'^^) is a purification of some state g^. Then, all possible sources 
S = {p{x), gf'}x having average state g^ equal to g^, namely, g^ = ^^re 
in one-to-one correspondence with positive-operator-valued measures (POVM) on TZ 
through the duality relation 

p{x)gf =: TV^ [{Rf ® 1®) . (6) 

In other words, when thinking of a classical-to-quantum communication scenario, the 
system Q represents the receiver, the source s models the quantum channel x i-^ g^ 
along which the classical information is sent, while the reference system 7Z can be 
understood as the sender's system which retains the information about the encoded 
index x. 



3. Information gains and disturbances 

3.1. Information gains: mutual information, accessible information, and quantum 
information gain 

Let us suppose that we know exactly the source s, in the sense that we know the 
probability distribution p{x) and the states g^ which are associated with the letters 
X G X. In this case, given the instrument A4^, it is straightforward to compute 
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the joint probability distribution p{m, x) = Tr[£miQx)], and, from it, the correlations 
between the input letter and the measurement readout. We choose here to measure 
them as classical mutual information 

where we implicitly used the Bayes' rule p{x, m) — p{x)p{m\x) . 

Imagine now that we ignore the exact structure of the source s, but its average 
state Qs- Even in this case we can define a mutual information-like quantity that 
describes how much information the instrument is able to extract: this quantity is 
the accessible information, when the measurement is fixed, and we maximize over all 
possible ensemble s' = {p'(x'), g'^,}x'£X' with average state g^' = g^, that is 

hcc{g.,M):^ max I{X' : X). (8) 

s':es'=e!i 

This quantity is well-known to be extremely hard to compute explicitly. It is then 
useful to introduce another quantity, which is much easier to calculate, yet equivalcni[j|| 
to /acc for finite dimensional systems. In Ref. [1] , we introduced the quantum 
information gain defined as the entropic defect [T^ of the ensemble induced on the 
reference TZ by the action of the instrument on Q 

iig,,M) S{g-^) - ^ p{m\g,)S{T:^^J, (9) 

me AT 

where S{g) := — Tr[glog2 g] is the von Neumann entropy, and other notations follows 
Eqs. ©-(ini). This quantity is what we are searching for, in that it is easy to compute 
and in Appendix A we prove that 

hccig.,M) < <t((2d-l)v/2/acc(fo,Al)) , (10) 

where d :— dim Q and i{x) is a positive, continuous, monotonic increasing function 
such that t(0) — 0. (The first inequality is the Holevo bound on accessible 
information [15j.) In other words, for finite dimensional systems, /acc(f?0,A^) ^ 
if and only if t(ps, A^) — > 0. About this point, as we mentioned before, an important 
caveat is in order: the appearance of the dimension d in Eq. pUj) makes the equivalence 
relation quite weak as dimension increases. Such a factor is reminiscent of the 
phenomenon known as locking of classical correlations |27j . and it is in general 
unavoidable 

As a concluding remark, notice that, even if the reference system 7?. explicitly 
appears in the definition quantum information gain only depends on the average 
input state g^ and on the instrument Al^, regardless of the particular purification 
1*''^^) chosen. 



3.2. Disturbances: average output fidelity, entanglement fidelity, and quantum 
disturbance 

Let us suppose that we know exactly the source s, in the sense that we know the 
probability distribution p[x) and the states gx which are associated with the letters 
x & X. Moreover, let us suppose that we know exactly the quantum instrument M^, 
in the sense that we know the outcome set X and the set of associated CP-maps 
X. Having access to the measurement readout m, but not to the input letter 

I Via bounds that, being dimension-dependent, must be handled with care: sec the following. 
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X being unknown to the receiver, the best (deterministic) correction we could try in 
order to reduce state disturbance, is in principle to engineer, depending on the source 
and on the instrument, a set of correcting channels {TZ^ }rn^x, mapping Q' back to 
Q, and to apply them onto the output state according to the measurement readout 
m. The maximum output fidelity approach aims at finding channels 7?,^ achieving 
the quantity 



■gEpwM ""™' ..?!. (11) 



for all m G X, where V{q,(j) :— {Tx\^y/a\)'^ is the fidelity. We however relax this 
condition by considering a situation where we are content to see that just the average 
value 

T^(s,M) max ^ p{x)f i ^ {S^[Qf)\ Qf] (12) 

is close to one. This means that, whenever Fav < 1, there exists at least one state Qx 
which gets irrecoverably disturbed by the corrected channel X^m^™ ° 1^ turns, 
due to concavity of fidelity, this means that, for such state g^, there exists at least 
one outcome m for which the corrected fidelity (|lip is strictly less than one. 

Suppose now that we instead ignore the exact structure of the source s, but know 
its average state g^i only. In this case, the entanglement fidelity [20] approach provides 
a suitable definition for disturbance. The quantity to consider in this case is 

maxFfi^^i^^^|4ifl(^,H (13) 

for all possible outcomes m G X, where, as usual, j^*^^) is a purification of the 
average state g^. As before, we relax a bit the request, also to simplify the notation, 
and focus on the average quantity 

mg.^M) max F ( ^ (id^®7^2'o£:2)(vI/^2),^P^2 ) . (14) 

Incidentally, notice that, in this setting, the restoring channels TZ^ cannot depend on 
the ensemble s, but on its average state g^ only. A value close to one for Fe{gs,M) 
means that, on the support of g^, the channel J2m<£X ° close to the identity 

channel id®. This in turn implies that, not only states are transmitted with high 
output fidelity, but also entanglement can be reliably sent. Entanglement fidelity is 
known to provide a lower bound on the worst possible average output fidelity, in the 
sense that 

'K{g„M)< min F^{5',M). (15) 

This is the reason why the entanglement fidelity criterion is generally much more severe 
than the average output fidelity criterion. The example provided in the introduction 
exhibits the extreme case in which the two criteria are strictly inequivalent. Notice that 
a value < 1 means that there is at least one outcome m for which the corresponding 
state change map is irreversible on the support of g^ . 
The quantum disturbance, defined in Ref. [4 as 

6{g,,M) := S{g?) - J.^^^'a- f^-^TZ ^ M^)i^'^^)) , (16) 
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where I^^^ {g'^^) := S{g^) — S{g'^^) is the coherent information from subsystem 
to subsystem B for the bipartite state g"^^, is a quantity bounded between zero and 
S{g^) and it is equivalent to the entanglement fidelity (I14p . in the sense that [4] 

i {l-F:{g.,M)Y < S{g,,M) < g(l - A^)), (17) 

where g(a;) is some positive, continuous, monotonic increasing function such that 
g(0) — 0. Moreover, the tradeoff relation is always obeyed [1] 

iig.,M)<dig,,M), (18) 

where Al) is defined in The equality in is achieved by "single-Kraus" or 
"multiplicity free" instruments, that are those for which every map £„i defining the 
instrument A4 is represented by a single contraction, that is, £m{g) — EmgE^^^. In this 
sense, single-Kraus instruments can be considered as least-disturbing measurements. 
At this point, the question remains open whether or not quantum disturbance S{gs,A4) 
is equivalent to average output fidelity Fav (s, Ai) as well, and, if it is, to which extent — 
in fact, they cannot be always equivalent, as the von Neumann measurement example, 
for which Fav{5,M) = 1 while 6{g^,A4) = S{g5), proves. To answer this question will 
be the aim of the next section. 



4. Unifying disturbances for irreducible ensembles of pure states 

Let us suppose now that the source s produces pure states, that is, g^ — for 
all X G X. Now, let us consider all possible N -complete paths, namely, all possible 
A^-sequences = (xi, 2:2, • • • , xn) G X^^ such that the corresponding set of states 

{IV'.^),I^.^>,---,I^.^„)} (19) 
contains, at least once, every state emitted by the sourcfl Let us denote the set 
of all possible iV-complete path for a given source s as and let = Utv ^s' 
the set of all possible complete paths. Let us moreover define the following function 
of a given ensemble s 

77 s := max , 20 

where the maximum is taken over all possible complete paths. Following Ref. [8], we 
say that the source s is irreducible if and only if 77(5) > 0, that is equivalent to say that 
the states IV'i^) cannot be divided into two or more orthogonal subsets. The notion 
of irreducibility makes rigorous the notion of "non-orthogonal ensembles" used in the 
introduction: in fact, within a given ensemble of pure states, some of them could well 
be orthogonal, yet the ensemble being irreducible. Notice moreover that, implicit in 
Eq. ()20|) there is another tradeoff: we can make the numerator larger by considering 
"more complicated" paths, but, eventually, also the denominator gets larger. Hence, 
to find the optimum value of ry(s) is a tricky procedure. However, for sources emitting 
a finite number of states, the solution (maybe corresponding to non-unique choices of 
optimal complete paths) always exists. 

The quantity 77(5) is a good indicator of the degree of irreducibility, or 
quantumness, of the ensemble s. There is however a hidden subtlety: till now wc did 

^ We could relax this condition by asking valid complete paths to form spanning sets for Supp(gp). 
This technicality, however, even being useful is some cases, would force our analysis to be notationally 
heavy. Wc hence prefer to loose a bit in the efficiency of our bounds while gaining in clarity. 



Towards a unified approach to information- disturbance tradeoffs 



9 



not consider the role of the probabihty distribution of the source yet. Imagine, for 
example, that a source is irreducible, in the sense that r]{s) > 0, but only because of 
one state |^0) being non-orthogonal with any other, whose probability of occurrence 
p(0) is however very small. Such a source is then irreducible, but only "weakly" 
irreducible: we should take this possibility into account. We then define the function 

C(s) := 77(5) • minp(a:) . (21) 

Again, since by definition min^; p{x) > 0, the source of pure states s is irreducible is and 
only if C(s) > 0, but now, contrarily to what happens for r/(s), we are automatically 
weighing the role of the probability distribution p{x). In the following theorem we see 
how C(s) enters in our discussion: 

Theorem 1 Let 5 — {p{x), \iIj^)}x£X be an ensemble of pure states, with average 
state g^, undergoing the quantum measurement described by the instrument A4^. 
Then 

i(i-^^(5,Ai))' < i(l-^;(^.„X))' (22) 

<S{g,,M) (23) 

where f(x) is some positive, continuous, monotonic increasing function such that 
f(0) = 0. □ 

In other words, whenever the ensemble s is irreducible, namely, C(s) > 0, then the 
notions of disturbance originating from average output fidelity, from entanglement 
fidelity, and from quantum disturbance are all equivalent, in the sense that 
Fav{s,M) Fe{gs,M) — > 1 44> d{gs,M) — > 0, and, equivalently, Fav{s,M) < 

1 Fe{gsi,M) < 1 <^ S{gs,M) > 0. On the other hand, as C(s) ^ (a situation 
happening when 77(5) — > or niinj;p(x) — > 0), while a high entanglement fidelity 
always implies a high average output fidelity, the converse direction becomes weaker 
and weakefl The example we mentioned in the introduction (involving orthogonal 
states and von Neumann measurement) where the two disturbances are irreconcilably 
inequivalent, even looking somehow over-simplified, actually captures all the essential 
features of those situations where average output fidelity and entanglement fidelity 
are really inequivalent quantities. 

Proof of Theorem [TJ Inequalities ([^^ and follow directly from Eqs. (fT5)) 
and (fT7|) . The last inequality is a consequence of the result proved in the 
Appendix B, which in turn follows the proof of Lemma 3 in Ref. ,8J. In fact, the 
quantum disturbance 6(gs,M) is defined as the coherent information loss of the 
channel as it is defined in Eq. mapping the input system Q in the output 

quantum-classical system Q'X. Moreover, whatever channel TZ^ , applied after 
in order to map the hybrid output system Q' X back to the input system Q, returns 
in fact a set of channels {TZ^ }m£X, each of them applied conditionally on the readout 

+ As noted in a previous footnote, by allowing a generalized notion of complete paths, we could also 
get rid here of signals — states with their corresponding probabilities — which are not necessary to span 
the subspace Supp(p5), and possibly obtain an effective parameter ^(s) which is larger than the one 
defined in Eq. I|2f p . For the sake of clarity, however, we will leave this improvement as an exercise 
for the interested reader. 
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m, all of them mapping Q' back to Q. This is indeed the way in which we defined 
Fav{5,M) in Eq. (fT2l) . Therefore, we can straightforwardly apply the statement of 
Proposition 1 in Appendix B and get Eq. ([24|) . ■ 



5. Entropy-based notion of average disturbance 

The main content of Theorem [T] is to make explicit the relation existing between 
disturbances defined in terms of average output fidelity and entanglement fidelity. 
Quantum disturbance appears there because of Eq. pT|) , which states that quantum 
disturbance is essentially equivalent to entanglement-fidelity-based disturbance. It is 
now reasonable to ask whether one can devise an entropy-based analog also for average- 
output-fidelity-based disturbance, in the hope that the relation existing between such 
a new measure of disturbance and quantum disturbance can be expressed in a form 
simpler than Eqs. ([^ - ((M)) . 

Let us consider an input ensemble s := {p{x), gx}xex undergoing a channel £. It 
is known that its entropy defect x(s) := S{gs) — J2xPi^)^i(^^) behaves monotonically 
under the action of a channel, that is, the entropy defect loss 

Ax(5,£) :=x(s)-x(f(s)) (25) 
is always nonnegative. Moreover (see Ref. and Lemma [2] in Appendix B[ ) it is 



known that the condition Fav{5,£) — *■ 1 implies that Ax{s,£) — > correspondingly. 
On the other hand, the opposite direction is known to hold only in the exact case, 
that is, Ax{5,£) ~ imphes that Fav{5,£) — 1 (see for example [S^)- In other words, 
at the moment we cannot state that entropy defect loss and corrected average output 
fidelity are truly equivalent measures of disturbance, since we do not know whether a 
small entropy defect loss implies a correspondingly small average fidelity loss. 

Anyway, let us postulate for the moment that entropy defect loss Ax(s,£) = 
is the sought informational analog of average output fidelity, much like quantum 
disturbance represents the informational analog of entanglement fidelity. In the case 
of a quantum measurement process, when wc consider the "channelized" action of the 
instrument M as given in Eq. ([2]), it is straightforward to compute 

x{M{5))^IiX ■.X)+J2pim)x{5m), (26) 

m 

where /(-X" : X) represents the (classical) mutual information between the input letter 
and the measurement outcome, as defined in Eq. ([7]), and 

s„,:=|p(x|m),%%l| (27) 



p{m\x) 



is the ensemble output by the measuring apparatus, given the outcome readout m. 
Notice that, for every conditional output ensemble Sm, the average state aj^_^ is 

a,„:^5:p(^H^^ = %%l, (28) 

according to Bayes' rule. Then, the entropy defect loss is equal to 
AX(5,A1) : ^ x{b) - X{M{5)) 

= Y.p{m)[x{s)~~x{^m)]~I{X:X). (29) 
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Let us now focus on a very special case of input ensemble (introduced in Ref. [33] 
and generalized in Ref. [34] )• It is possible to define such an ensemble for any given 
ensemble average state g^i as follows: given the average state on system Q, after 
having purified it as j^*^^) by means of a reference system TZ. isomorphic to Q, one 
defines the pure states along with their occurrence probabilities p{x) as 



In the above equation, the vectors {(j)^}x£X constitute a rank-one POVM on TZ with 
2d elements, such that Tr[(/)a;] = l/2d for all x, and defined as follows: d of them 
form a basis diagonalizing TrQ[5'''^^], while the other d vectors form another basis, 
mutually unbiased with respect to the first one. 

For this particular input ensemble of pure states, that we call of Christandl- Winter 
type, it is known that [331 131] 



that is, entropy defect loss and quantum disturbance are equivalent quantities, and 
their relation is strikingly more stringent than the relation existing between average 
output fidelity and entanglement fidelity as given in Theorem[T| However, as we stated 
in the introduction, the extension of this approach to the more general case where the 
input ensemble does not come from such a highly symmetric construction seems to be 
a hard path to pursue, yet worth further investigations. 

6. Conclusion and discussion 

With Theorem 1 we showed that, whenever the input source is sufficiently rich in 
its structure — i. e. it is irreducible — the vast majority of definitions of disturbance, 
introduced in the literature so far, actually turn out to be equivalent. In particular, 
we focused on the relations existing between average output fidelity and entanglement 
fidelity, without imposing any symmetry on the source distribution. Our analysis 
could hence unify different approaches to the quantification of information-disturbance 
tradeoffs in quantum theory. The general bounds obtained here exploit Fannes-like 
bounds on entropic quantities, and, for this reason, relate extensive quantities with 
non-extensive ones. In the very particular case of input ensembles of the Christandl- 
Winter type, we showed a way to circumvent this difhculty and obtain much more 
stringent bounds. A future research direction is to extend this latter approach to 
more general situations. 

Acknovirledgments 

F. B. acknowledges Japan Science and Technology Agency for support through the 
ERATO-SORST Quantum Computation and Information Project. M. H. is supported 
by EC IP SCALA. Part of this work was done while F. B. was visiting the Institute 
of Theoretical Physics and Astrophysics at the University of Gdansk. 

Appendix A. Accessible information and quantum information gain are 
equivalent criteria to measure information gain 

We exploit here a method introduced in Ref. 25J. Let us consider an ensemble 
•5 — {pi^)y 9x}xex- The accessible information is defined as the maximum over all 



p{x)^^ := Tr7t[((/)f ® 1^) 



(30) 



Ax{s,M) < S{g„M) < 2Ax{s,M) 



(31) 
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possible POVM's of the mutual informatiorQ 

/acc(s) := max I{X : X). (A.l) 

Now, let x(s) be the entropy defect of the ensemble s 

X(s) (A.2) 

X 

where S{q) :— —TT[glog2 g] is the von Neumann entropy. Holevo bound states 
that [15] 

/acc(s) < X(S)- (A.3) 

By rearranging terms in Eq. (|A.2p , we can write 

X(s) = ^p(2:)(5(&)-5(fe)). (A.4) 

X 

By Fannes inequality [23] , we know that 

x{B) = J2p{x)iSig,)-S{g,)) 

X 
X 

< ^p{x)t{\\gx - foil) 

X 

<i(^p{x)\\gx-g4i^^ (A.5) 

where t(a;) is a positive, continuous, monotonic increasing, concave function such that 
t(0) = and :— Tr \X\ is the trace-norm. In particular, for a; < 1, we can take 



tix)=xlog2^ , (A.6) 

where d is the dimension of the underlying space ^30j . 

Let us now make a small detour, and introduce so-called informationally complete 
POVM's, namely, those POVM's whose elements also form an operator basis. This 
means that, being {Pm}m an info-complete POVM, there exists a dual frame {Km}m, 
with Hermitian operators Km, such that 

X-^Tr[XP„]if™, (A.7) 

m 

for all operators X. We consider here a particular info-complete POVM, that is, 
{Pg}geG defined as 

Pg := ^Ug^Ul (A.8) 

where Ug is a unitary representation of the group §U(d), and is a d-dimensional 
pure state. In Ref. the canonical dual frame has been explicitly calculated, and 
it holds that 

\\Kg\\^^2d-l, Vg. (A.9) 

* In fact, in the text we came to accessible information from the dual point of view, where the 
measurement (i. e. the POVM) is fixed and the maximization is done with respect to the input 
ensemble: thanks to the POVM/ensembles duality relation (|6]l the two approaches are equivalent. 
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Let us go back to out original aim, i. e. to give an upper bound to xi^)i ^'^d 
continue from Eq. (IA.5|) 



^5 111 



(A.IO) 



g 

<iU2d-l)Y,p{x)Y,\\p{g\x)-p{g)\\A 

\ x g / 

< t ((2d - l)v/2/(X : G)) (A.ll) 

<t((2d-l)v/2/acc(5)) . (A.12) 



Eq. (jA.10[) uses the expansion formula (|A.7p specialized to the info-complete 
POVM (jXsl) . while to obtain Eq. (jXTT]) we used Pinsker inequality 

\\Q-a\\l<2D{g\\a), (A.13) 

where D{g\\a) := Tr[gilog2f3— gilogjcr] is the quantum relative entropy. Eq. (jA.12p 
comes from the very definition of accessible information, which is defined as the 
maximum over all possible POVM's. 
In conclusion, we have that 



4, 



:(s)<X(s)<t((2d-l)v/2/acc(s)), (A.14) 

that means that, for finite dimensional systems, /acc(s) — > if and only if x(s) 0. 
In other words, accessible information and entropy defect are equivalent quantities. 
Notice that in passing by we actually proved that 

^p(x) life - foil < (2d - l)V2/acc(s). (A. 15) 

X 

Such a bound is a little better than the one proved, with different techniques, in 
Ref. [37]. 



Appendix B. Average output fidelity and entanglement fidelity are 
equivalent criteria to measure disturbance for irreducible pure states 
ensembles 

In the following, let us consider a channel, that is, a completely positive trace- 
preserving map £ : Q ^ Q', and a state g which undergoes £. As it is often done, 
we proceed here with the tripartite purification of the whole setting. The first step 
is to purify the input state g^ into a bipartite pure state \^'^^), shared with a 
reference system TZ, as we did in Section 2. The second step consists in constructing 
the isometry V : Q ^ Q' A. (the isometric condition reads V^V = 1®) such that 

£{g) = TyA[VgV^], (B.l) 

for all input state g. The auxiliary system is sometimes called ancilla (or 
environment) and the construction in Eq. (jB.ip is usually referred to as the 
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Stinespring's dilation of £. Moreover, Stinespring's dilation induces an ancillary 
channel £ : Q ^ A. defined as 

£{Q):=TyQ'[VgV% (B.2) 

for all g. The channel £ is defined up to local isometric transformations on A.. By 
combining these two "purifications" , we can write the tripartite pure state 

|T^2'A) := (1^ (g> (B.3) 

such that Tr7?:A[T'^S'-^] = £{g). 

For our later convenience, we now introduce two useful notation. Given an 
ensemble s :— {p{x), gx}x, we denote with x(s) its entropy defect, in formula 

x{B):^S{g,)-yp{x)S{gx), (B.4) 



where S{g) := —TT[g\og2 g] is the von Neumann entropy. By the monotonicity 
property of quantum relative entropy, after the action of a channel, the entropy defect 
drops, and we denote the difference between input x and output x, 

Ax(s,f) :=x(s)-x(£(s)). (B.5) 

We also define the coherent information loss S{g,£), due to the action of a channel 
f ^ : Q — > Q' onto a state g^ which is a subsystem of a larger entangled pure state 
as 

Sig,£) :^S{g^)-I^-^' [{id^ ® £^)i^'^^)) , (B.6) 

where I^^^{g'^^) := S{g^) — S{g'^^) is the coherent information from subsystem 
A. to subsystem B for the bipartite state g'^^. We can now state the following 

Lemma 1 Let £ : Q ^ Q' be a channel. Lets :— {p{x), \ipx)}x be an input ensemble 
of pure states. Then, 

6ig,,£) = Ax{s,£)+x{£is)), (B.7) 

that is, coherent information loss equals the sum of input ensemble entropy defect loss 
plus the entropy defect of the induced ancillary ensemble. □ 

Proof. The proof simply follows by direct inspection, as done in Ref. [22]. Notice 
that the condition of having pure input states is crucial. ■ 

Now, analogously to Eq. (|12p . let us define the corrected average output fidelity 
for a channel £ : Q —> Q' (there are no classical outcome here, only quantum, as the 
former is a special case of the latter) when the input ensemble is s :~ {p{x), gx}xi£X 
as 

T]^{s,£) := max ^ p{x)F(n{£{ Qx)),gx^, (B.8) 
xex 

where the maximum is taken over all possible channels TZ mapping Q' back to Q. 

Lemma 2 We have that 

Ax(5,£) <fi(l-T^(s,£)), (B.9) 

where fi(x) is a positive, continuous, monotonic increasing, concave function such 
</iai fi(0) = 0. □ 
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Proof. Following Ref. [21], given two sources of K states s := {p{x),gx}x and 
s' := {pix),g'x}x, we can apply the refined Fannes' continuity relation for von 
Neumann entropy [30] and get 

|x(s)-x(s')l <2A-^/ilog2^, (B.IO) 

e 

where e 1 — J2xP(^)^(Px^ P'x)^ ^o^' the fidelity defined as f{p,cr) '■— {'Tr\y^y/a\)'^ . 
The statement is simply recovered by specializing such a relation to the ensembles 
s and TZ{£{s)), with fi(x) := 2K^\og2{dQ/x), for sufficiently small x (typically 
x<2/e^). U 



Lemma 3 We have that 



where for a given ensemble s, is defined in Eq. i21\) . and hix) is a positive, 

continuous, monotonic increasing, concave function such that f2(0) =0. □ 

Proof. We carefully follow here the proof of Lemma 3 given in Ref |8j. Noticing 
that we can always take < dg, and denoting e := 1 — Favis, £) and e' := > e, 
provided that e' < 1 we have that 

x(f(5)) <47V\/^log2^, (B.12) 

where iV > is the number of states in the optimal path in Eq. ([20]) , which is always 
greater of equal than the number of states K emitted by the source. ■ 

Finally we obtained the following 

Proposition 1 Let £ : Q ^ Q' be a channel. Let s := {p{x), \i/Jx)}x bs an input 
ensemble of pure states. Then, provided that Fav{5,£) > 1 — (C(s))^ and that Fav{s,£) 
is sufficiently close to one, we have that 



, 'l-FUs,£)\ 
6{g.,£)<i{^ , (B.13) 

where f(x) is a positive, continuous, monotonic increasing, concave function such that 
f(0) — 0. Ln other words, whenever the ensemble S is irreducible, that means C(s) > 0, 
the condition Fav{5,£) — > 1 implies that 6{qs,£) ^0. □ 

Proof. Simply by combining Eqs. (|B.7p . (jB.lOp . and (|B.12p . and noticing that e' > e 
and N > K we see that 

S{g„£)<6NV?\og2^, (B.14) 

for 



which is the statement. ■ 



1-Fayi5,£) 

-, (B.15) 
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